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Abstract 

Almost every biological, economic and social system is a complex 
adaptive system (CAS). Mathematical and computer models are rele- 
vant to CAS. Some approaches to modeling CAS are given. Applica- 
tions in vaccination and the immune system are studied. Mathemati- 
cal topics motivated by CAS are discussed. 
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1 Basics of complex adaptive systems (CAS) 

Definition (1): A complex adaptive system consists of inhomogeneous, in- 
teracting adaptive agents. Adaptive means capable of learning. 

Definition (2): An emergent property of a CAS is a property of the sys- 
tem as a whole which does not exist at the individual elements (agents) level. 
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Typical examples are the brain, the immune system, the economy, social 
systems, ecology, insects swarm, etc.. 

Therefore to understand a complex system one has to study the system 
as a whole and not to decompose it into its constituents. This totalistic 
approach is against the standard reductionist one, which tries to decompose 
any system to its constituents and hopes that by understanding the elements 
one can understand the whole system. 

2 Why should we study complex adaptive sys- 
tems? 

Most of living systems are CAS. Moreover they have intrinsic unpredictability 
which causes some "seemingly wise" decisions to have harmful side effects. 
Therefore we should try to understand CAS to try to minimize such side 
effects. Here we give two examples of these side effects. 

Mathematical models have played important roles in understanding the 
impact of vaccination programs. The complications of infectious diseases 
spread make the problem of predicting the impact of vaccinations a nonlin- 
ear problem. Sometimes a counter-intuitive result appears e.g. the threshold 
phenomena [Edelstein-Keshet 1988]. Here another example will be men- 
tioned. 

Several vaccination programs are known e.g. mass vaccination where all 
population is vaccinated, target vaccination where only a certain group is 
vaccinated. 

If one tries to understand the expected impact of a vaccination program 
one should take the following points into account: 

1. Vaccination is not perfect hence a probability of vaccination failure 
should be assumed. 

2. Sometimes vaccination takes time to be effective. 

3. Immunity is waning i.e. may be lost with time. 

4. Long range contacts can play a significant role e.g. SARS (severe acute 
respiratory syndrome) has been transmitted between countries via air 
travellers. 



2 



Rubella is a mild viral infectious disease. Typically it is most dangerous 
when infecting a pregnant female where it has severe effects on the fetus. 
Once one gets it he (she) gets a life long immunity. There are several vac- 
cination strategies for rubella [Vynnycky et al 2003]. The US policy is to 
vaccinate all two years old children. The UK policy is to vaccinate only 14- 
years old girls. Another strategy which is adopted in some underdeveloped 
countries is not to vaccinate at all. It has been found [Jazbec et al 2003] that 
in most cases the UK strategy is equal or better than the US one despite 
being cheaper. 

An interesting situation arose when some countries adopted a private 
sector vaccination to MMR (Measles, Mumps and Rubella) [Vynnycky et al 
2003]. It was expected that the number of Congenital Rubella Syndrome 
(CRS) will decrease. However it did not and in some countries (e.g. Greece 
and Costs Rica) it increased. The reason can be understood as follows: This 
vaccination to part of the population decreases the probability of contracting 
the disease at young age. Hence the number of susceptible individuals at 
adulthood increases. Consequently the probability of contracting the disease 
at adulthood increases. This is an example of the counterintuitive effects of 
some vaccination programs. 

Another example for bad side effects is Lake Victoria [Chu et al 2003] 
where a new species called Nile perch was introduced expecting that it is 
more economically profitable. Yet the following results have appeared: 

(i) The local fishermen's tools were not suitable for the new fish hence only 
large corporations benefited. 

(ii) Due to its higher price the locals were unavailable to buy the new type. 

(iii) The original fish used to eat the larva of mosquitoes but now mosquitoes' 
numbers have increased significantly thus the quality of life of the locals 
have deteriorated!! 

There are at least two sources for unpredictability in CAS. The first is 
the nonlinear interactions between its agents [West 1990]. The second is 
that CAS are open systems hence perturbation to one system may affect 
another related one e.g. perturbation to Lake Victoria affected the number 
of mosquitoes. 
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3 How to model a CAS? 



The standard approaches are 

1. Ordinary differential equations (ODE), difference equations and partial 
differential equations (PDE). 

2. Cellular automata (CA) [Ilachinski 2001]. 

3. Evolutionary game theory [Hofbauer and Sigmund 1998]. 

4. Agent based models. 

5. Networks [Watts and Strogatz 1998] etc.. 

6. Fractional calculus [Stanislavsky 2000]. 

Some of these approaches are included in [Boccara 2004]. 
The ODE and PDE approaches have some difficulties as follows [Louzon 
et al 2003]: 

(i) ODE and PDE assumes that local fluctuations have been smoothed 
out. 

(ii) Typically they neglect correlations between movements of different 
species. 

(hi) They assume instantaneous results of interactions. 

Most biological systems show delay and do not satisfy the above assump- 
tions. They concluded that a cellular automata (CA) [Ilachinski 2001] type 
system called microscopic simulation is more suitable to model complex bio- 
logical systems. We agree that CA type systems are more suitable to model 
complex biological systems but such systems suffer from a main drawback 
namely the difficulty of obtaining analytical results. The known analytical 
results about CA type systems are very few compared to the known results 
about ODE and PDE. Some mathematical results about CA are given in the 
appendix. 

Now we present a compromise i.e. a PDE which avoids the delay and the 
correlations drawbacks. It is called telegraph reaction diffusion equations 
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[Ahmed and Hassan 2000]. To overcome the non-delay weakness in Fick's 
law it is replaced by 



dJ(x,t) „ dc 
J(,, t ) + T-Lli = -D__, (!) 

where the flux J(x, t) relaxes, with some given characteristic time constant 
t and c is the concentration of the diffusing substance. Combining Eq. (1) 
with the equation of continuity, one obtains the modified diffusion equation 
or the Telegraph equation: 

dc d 2 c £)® 2c (2) 
dt dx 2 dx 2 
The corresponding Telegraph reaction diffusion (TRD) is given by 

where /(c) is a polynomial in c. 

Another motivation for TRD comes from media with memory where the 
flux J is related to the density c(x, t) through a relaxation function K(t) as 
follows 



J( x ,t) = - f K(t-t) 
Jo 



dx 



It can be shown [Compte & Metzler 1997] that, with a suitable choice for the 
kernel K(t), the standard Telegraph equation is obtained. 

A third motivation is that starting from discrete space time one does not 
obtain the standard diffusion equation but the telegraph equation [Chopard 
and Droz 1991]. 

Moreover it is known that TRD results from correlated random walk 
[Diekmann et al, 2000] . This supports the conclusion that Telegraph reaction 
diffusion equation is more suitable for modeling complex systems than the 
usual diffusion one. 
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4 The immune system as a complex system 
[Segel and Cohen 2001, Ahmed and Hashish 
2004] 

The emergent properties of the immune system (IS) included: 

* The ability to distinguish any substance (typically called antigen Ag) 
and determine whether it is damaging or not. If Ag is non-damaging 
(damaging) then, typically, IS tolerates it (responds to it). 

* If it decides to respond to it then IS determines whether to eradicate 
it or to contain it. 

* The ability to memorize most previously encountered Ag, which enables 
it to mount a more effective reaction in any future encounters. This is 
the basis of vaccination processes. 

* IS is complex thus it has a network structure. 

* The immune network is not homogeneous since there are effectors with 
many connections and others with low number of connections. 

* The Ag, which enters our bodies, has extremely wide diversity. Thus 
mechanisms have to exist to produce immune effectors with constantly 
changing random specificity to be able to recognize these Ag. Conse- 
quently IS is an adaptive complex system. 

* Having said that, one should notice that the wide diversity of IS con- 
tains the danger of autoimmunity (attacking the body). Thus mecha- 
nisms that limit autoimmunity should exist. 

* In addition to the primary clonal deletion mechanism, two further bril- 
liant mechanisms exist: The first is that the IS network is a threshold 
or "window" one i.e. no activation exists if the Ag quantity is too low 
or too high (This is called low and high zone tolerance). 

* Thus an auto reactive immune effector (i.e. an immune effector that 
attacks the body to which it belongs) will face so many self-antigens 
that it has to be suppressed due to the high zone tolerance mechanism. 
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* Another mechanism against autoimmunity is the second signal given by 
antigen presenting cells (APC). If the immune effector is self reactive 
then, in most cases, it does not receive the second signal thus it becomes 
anergic. 

* Also long term memory can be explained by the phenomena of high 
and low zone tolerance where IS tolerates Ag if its quantity is too high 
or too low. So persisting Ag is possible and continuous activation of 
immune effectors may occur. 

* There is another possible explanation for long term memory using the 
immune system (Extremal Dynamics). 

* Thus design principles of IS can explain important phenomena of IS. 

An interesting example is given by Matzinger [Matzinger 2002] where 
she argued that to prevent transplant rejection it may be more useful to 
design drugs that blocks signal II and not signal I (which the present drugs 
do). The reason is blocking signal II make the effectors (which originally 
were capable of recognizing the transplant) anergic while leaving the other 
immune effectors intact. 

5 Conclusions 

(i) CAS should be studied as a whole hence reductionist point of view may 
not be reliable in some cases. 

(ii) CAS are open with nonlinear local interactions hence: 

1. Long range prediction is highly unlikely [Strogatz 2000, Holmgren 
1996]. 

2. When studying a CAS take into consideration the effects of its 
perturbation on related systems e.g. perturbation of lake Victoria 
has affected mosquitoes' numbers hence the locals quality of life. 
This is also relevant to the case of natural disasters where an 
earthquake at a city can cause a widespread power failure at other 
cities. 

3. Expect side effects to any "WISE" decision. 
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4. Mathematical and computer models may be helpful in reducing 
such side effects. 

(hi) Optimization in CAS should be multi-objective and not single objective 
[Collette and Siarry 2003]. 

(iv) CAS are very difficult to control. Interference at highly connected sites 
may be a useful approach [Dorogovtsev and Mendez 2004]. The inter- 
linked nature of CAS elements complicates both the unpredictability 
and controllability problems. It also plays an important role in inno- 
vations spread. 

(v) Memory effects should not be neglected in CAS. This lends more sup- 
port for the proposed telegraph reaction diffusion Eq. (3). Also mem- 
ory games have been studied [Smale 1980, Ahmed and Hegazi 2000]. 
Also delay and fractional calculus are relevant to CAS. 

(vi) Mathematical topics motivated by CAS include ODE and PDE (non- 
autonomous, delayed, periodic coefficients, stability and persistence), 
multi-objective optimization (including biologically motivated methods 
e.g. Ant colony optimization, Extremal optimization, Genetic algo- 
rithm etc), difference equations, cellular automata, networks, fractional 
calculus, control (e.g. bounded delayed control of distributed systems), 
game theory, nonlinear dynamics and fuzzy mathematics. 

Some of the mathematics motivated by CAS will be reviewed in the ap- 
pendices. 
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Appendix (1): Some mathematical results for 
one-dimensional cellular automata 

Definition (3): A cellular automata consists of 4 components: A graph G, 
a set of states such that each site (vertex) of the graph has one of the possible 
states, a neighborhood set which assigns to each vertex a certain neighbor- 
hood and a transition function / which defines the evolution of the state of 
each site as a function of the states of that site and those in its neighborhood. 

We choose the set of possible states to be the ring Z(p) i.e. the set of 
integers 0, 1, 2, ...,p — 1, where addition is defined mod p. The total number 
of sites is denoted by N. In most of the cases, we choose N, p to be relatively 
prime. The set of states of the sites at a given time is called a configuration. 
We now restrict us to a one-dimensional space. Let x(j, t) be the state of site 
j at time t. 

Definition (4): A finite initial configuration is one such that there are 
two natural numbers L, R such that < L < R < N, and x(j, 0) = if 
j < L or j > R. 

Theorem (1) [Jen 1990]: If x(i,t), x(j,t), % < j are two periodic sequences 
i.e. x(i,t) = x(i,t +p(i)), x(j,t) = x(j,t +p(j)), then for every k such that 
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i < k < j then x(k, t) is periodic. 

Corollary (1) [Jen 1990]: If CA evolves according to the rule 

x(i, t) = f(x(i - 1, t),x(i, t),x(i + 1, t)) mod 2, (4) 

such that 000 — > 0, 100 — > 0, 001 — > 0, then for any finite initial configuration 
the system is temporarily periodic i.e. the sequence (x(i,t)) is periodic for 
all % such that <i< N,0 <T <t. 

Proof. The fact that 100 — > implies that x(i, t) = for i > R, simi- 
larly x(i,t) = for i < L for all t > 0. Applying theorem (1) the result is 
proved. 

In the case that / in Eq. (4) is linear, one can use the methods of [Stevens 
et al 1993, Tadaki 1994] to get useful information about possible periodicity's 
of the system. In this case the system can be written as 

X(t + l) = UX(t), (5) 

where U is called the evolution matrix. Then X(t) = U t X(0). In this 
case the asymptotic behavior of the system is governed by the characteristic 
polynomial of U on the field Z(p). Assuming periodic boundary conditions, 
the matrix U is circulant matrix [Barnett 1990]. 

Let P(N, A) be the characteristic polynomial of the system (5) with 
sites, then typically it has the form 

P(n,X) = X a d(n,X), d(n,0) = 1. (6) 

If a > 0, then the systems tends to a fixed configuration (which corresponds 
to a fixed point for discrete time continuous state dynamical systems). Re- 
ducing d(n, A) to its irreducible factors on the field of states then in most 
cases a cycle of length p k — 1 exist for the system where k is the degree of 
the irreducible factors. 

As an example consider rule 90 [Martin et al 1984] 

x(i, t + l)=x(i-l,t)+x(i + 1, t) mod 2. (7) 

For N = 5, we have P(5, A) = A(A 2 + A + l) 2 mod 2, hence the system may 
evolve to a fixed configuration (e,g, x(i,t) = for all t > T > 0, for all 
< i < N). It can also evolve to a cycle of period 3 (= 2 2 — 1). 
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Similarly for N = 9, P(9, A) = A(A + 1) 2 (A 3 + I + l) 2 on Z(2). Hence this 



system may evolve to a fixed configuration or to a periodic one with period 
7. For N = 13, similar study implies that P(13, A) = A(A 6 + A 5 + A 4 + A + l) 2 
on Z{2). Hence fixed configurations and periodic ones with period 63 are 
expected. Such long periods may not be easy to find numerically. These 
results can be obtained using more elaborate methods [Martin et al 1984]; 
but the simplicity of the present approach is appealing. 

Moreover it is directly applicable to nonlocal cases which have gained 
much attention after the pioneering work of Watts and Strogatz on small 
world network (SWN) [Watts and Strogatz 1998]. As an example consider 
the following system 



where k is fixed. Some of the characteristic polynomials P(N,A,k) are: 



Hence we have the following proposition: 
Proposition (1): a) The system (8) depends on k. 

b) The asymptotic behavior of (8) contains the following: For N — 11, k — 3, 
no fixed configuration but a periodic one with period 1023. 

Proof, a) For N — 11, k = 5, a homogeneous configuration is expected. 
This is not the case for N — 11, k = or k = 3. 

b) Use the procedure explained before. 

Typically updating of CA is synchronous. It is important to notice that 
other types of updating e.g. a uniform random asynchronous one (where 
only one site is chosen randomly and updated at each time step) gives other 
patterns [Schonfisch and de Roos 1999]. The following lemma is useful 

Lemma (1): a) States which are stationary under synchronous updating 



x(i, t + 1) = x(i — 1, t) + x(i + 1, t) + x(i + k, i) mod 2, 



(8) 



P(11,0,A) 
P(11,3,A) 
P(11,1,A) 



A 11 + A 10 + A 5 + A 4 + A + 1, 

A 11 + A 9 + A 7 + A 6 + A 5 + A 4 + A + 1, 

A n + A 8 + A 7 + A 5 + A 2 + 1, 



(9) 
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are also stationary under asynchronous one. 

b) If there is a site j which is not updated for all time t > T > then 
stationary configuration with respect to asynchronous updating may not be 
so under synchronous one. 

Proof, a) lif(x(l),x(2),...,x(N)) = (x(l), x(2), x(N)) then f(j,x(j)) = 
x(j). This proves part a). Since site j is not updated for t > T > then 
f(j, x (j)) X U) can s tih belong to a homogeneous configuration for the 
asynchronous updating but not the homogeneous one. This proves b). 

Loosely speaking patterns present in asynchronous updating are mostly 
present in synchronous one. Motivated by these results we study sequential 
CA e.g. the sequential rule 90 is 

x(j, t + l) = x(j - 1, t + 1) + x(j + 1, t) mod 2. (10) 

This can be written in the following equivalent form 

x(j, t + 1) = x ( k , t) mod 2, (11) 

k=2 

where free periodic boundary conditions are assumed. The characteristic 
polynomials of the system (7) are: 

P(5,A) = A 5 + A 3 , P(6, A) = A 6 + A 5 + A 3 , P(7, A) = A 7 , 
P(13,A) = A 7 (A 3 + A 2 + 1) 2 . 

Hence homogeneous configurations are expected for N = 5,6, 7. For N — 13 
a periodic configuration with period 7 is expected. 

Studying the system (10) numerically showed that chaos (in the sense of 
sensitive dependence on initial conditions which is sometimes called damage 
spread) exists. 

Proposition (2): Every initially finite configuration will evolve under the 
CA 

x(j, t + l) = x(j - 1, t + 1) x(j + l,t) mod 2, (12) 

into the zero configuration x(j,t) = for all j, < j < N, for all time 
t > T > where T < N. 
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Proof. We have 

x{R, 1) = x(R + 1, 0) x(R -1,1). 

But 0) = by definition of initially finite configuration thus x(R, 1) = 

0. Repeating for x(R — 1, 2), one gets — 1, 2) = and continue. 

Now the above results are applied to two known examples. The first is 
Domany-Kinzel (DK) model [Kinzel and Domany 1984], which is given by: 

If x(j -l,t)+x(j + l,t) = then x(j, t + 1) = 0. 
If x(j - l,t)+x(j + l,t) = 1 then + 1) = 1, 

with probability p\. (13) 
If x(j - l,t) +x(j + l,t) = 2 then * + 1) = 1, 

with probability p 2 - 

where x(j,t) are Boolean variables. For — > 1, p 2 — * 1, the system (13) 
corresponds to the CA 

t + 1) = x(j - 1, t) + x(j + l,t) + x{] - 1, t) x(j + l,t) mod 2. (14) 

Proposition (3): Any finite initial configuration with two consecutive ones 
will tend to the homogeneous configuration x(j, t) — 1 for all < j < N, 
< T < t, T is sufficiently large under the CA (12). Consequently the region 
Pi — > 1, p 2 — > 1 in the DK CA does not show chaos (damage spread). 

Proof. Assume that x(j, 0) = x(j + 1,0) = I. Then the system (14) implies 
x(J -1,1) = 1) = x(j + 1, 1) = x(j + 2, 1) = 1. 

Continue one gets after t time steps t) = 1, where j — t < k < j + 1 + 1. 
This proves the first part. Now since the CA (14) will tend to x(j,t) = 1 
for all < j < N, < T < t, then any change in the initial conditions that 
preserves the condition x(j, 0) = x(j + 1, 0) = 1 for some j will not affect the 
asymptotic behavior of the CA (14). This completes the proof. 
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The case pi — > in the DK model corresponds to the CA 

x(j, t + 1) = x(j - 1, i) x(j + 1, t) mod 2. (15) 
Following similar steps as those in proposition (3) one can prove the following: 

Proposition (4): Any finite initial configuration with two consecutive zeros 
will tend to the homogeneous configuration x(j,t) = for all < j < N, 
< T < t, T is sufficiently large under the CA (15). Consequently the 
region p\ — > 1, p 2 — > 1 in the DK CA does not show chaos (damage spread) 
or periodic configurations. 

In the limit p 2 — > 0, p 1 — > 1, DK model corresponds to rule 90 

a;(j, f + 1) = x (j - 1, t) + x(j + 1, t) mod 2, 

which is known to be chaotic. 

All of the above results agree with numerical simulations. Bagnoli et al 
model [Bagnoli et al 2002] is given by 

If x(j-l,t)+x(j,t)+x(j + l,t) = then x(j, t + 1) = 0. 

If x(j - l,t) +x(j,t) +x(j + l,t) = 1, 

then x(j, t + 1) = 1 with probability p±. 

Ux(j-l,t)+x(j,t)+x(j + l,t) = 2, (16) 

then x(j, t + 1) = 1 with probability p 2 - 

If x(j - l,t) +x(j,t) +x(j + l,t) = 3 then x(j, t + 1) = 1. 

where x(j,t) are Boolean variables. The limit p± — > 1, p 2 — > 1 corresponds 
to the CA 

x0',t + l) = x(j-l,t)+a;(j + l,t)+x(j,t)+x(j-l,t)a;(j + l,t) + 
x(j, t) x(j + 1, *) + x(j - 1, t) + (17) 
x(j - 1, t) t) x(j + 1, t) mod 2. 

The limit pi — * 0, p 2 — * corresponds to the CA 

t + 1) = x(j - 1, t) x(j + 1, mod 2. (18) 

Proposition (5): a) Any nonzero finite initial configuration will evolve 
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under the CA (17) into the homogeneous configuration x(j,t) = 1 for all 
< J ; < N, for t is sufficiently large. Hence the limit p\ — > 1, p 2 — > 1 in 
Bagnoli et al model does not show chaos or periodic configurations. 

b) Any finite initial configuration containing at least one zero site will evolve 
under the CA (18) into the homogeneous configuration x(j, t) = for all 
< J ' < N, for t is sufficiently large. Hence the limit p± — > 0, j>2 - *• in 
Bagnoli et al model does not show chaos or periodic configurations. 

Proof, similar to proposition (3). 

The limit p\ — > 1, p 2 — > corresponds to the CA 



which is similar to rule 150 x(j, t + 1) = x(j — l,t) + x(j + l,t) + x(j,t), 
hence chaos is expected in Bagnoli et al model in this limit. All of the above 
results agree with numerical simulations. 

It is interesting how CA unite polynomials on finite fields, circulant ma- 
trices, graph theory techniques and many other branches of mathematics into 
one branch which is important both mathematically and from the point of 
view of applications in complex systems. 



Appendix (2): Overview of networks in CAS 



Complex systems are often modeled as graphs where agents are the vertices 
and the interactions form the edges of the graph. Typically graphs are either 
regular lattices (e.g. square or cubic), random or scale free where the proba- 
bility that a vertex has degree k is p(k) « k^ 1 . Most of the real networks are 
of the scale free type. Some proposed mechanisms for this fat tailed distri- 
bution [Dorogovtsev and Mendes 2004] are self organization (c.f. biological 
systems) and optimization involving many agents (c.f. economy). 

Random graphs were first studied by the mathematicians Erdos and Renyi 
[Erdos and Renyi I960]. Their model consists of iV nodes, such that every 
pair of nodes is connected by a bond with probability p. The recent increase 
in computing power and the appearance of interdisciplinary sciences has lead 
to a better understanding of the properties of complex networks. 



x(j,t + l) 



x(j -l,t) + x(j + l,t)+x(j,t) + 
2x(j-l,t) x(j,t) x(j + l,t) mod 2 



(19) 
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Two main properties of complex networks are clustering and small world 
effect. 

Small-world effect means the average shortest node to node (vertex to 
vertex) distance is very short compared with the whole size of the system 
(total number of vertices). For social networks, the social psychologist Mil- 
gram [Milgram 1967] concluded that the average length of the path of ac- 
quaintances connecting each pair of people in the United States is six. This 
concept is known as the six degrees of separation. Such an effect makes it 
easier for an effect (e.g. an epidemic) to spread throughout the network. 

In a regular 1-dimensional lattice of size N, the average shortest path con- 
necting any two vertices / increases linearly with the system size. So regular 
lattices do not display small- world effect. On the other hand for a random 
graph, with coordination number z, one has z first (nearest) neighbors, z 2 
second neighbors and so on. This means that the total number of vertices 
N = z l , this gives 

, _ MAT) 
Ln(z) ' 

The logarithmic increase with the size of the lattice allows the distance I to 
be very short even for large N. Then random graphs display the small-world 
effect. 

Clustering is a common property of complex networks. It means that ev- 
ery vertex has a group of connected nearest neighbours (NN) (collaborators, 
friends), some of them will often be a connected NN to another vertex. As 
a measure for the clustering property, a clustering coefficient C is defined as 
the probability that connected pairs of NN of a vertex are also connected to 
each others. For a random graph, C = z/N which goes to zero for large N. 
So random graphs do not display clustering property. On the other hand, a 
fully connected regular lattice itself forms a cluster, then its cluster coefficient 
is equal to 1. 

Complex networks display a small-world effect like random graphs, and 
they have large clustering coefficient as regular lattices. For a review on 
many real- world examples, see [Dorogovtsev and Mendes 2004]. 

A small-world network (SWN) proposed initially by Watts and Strogatz 
[Watts and Strogatz 1998] is a superposition of a regular lattice (with high 
clustering coefficient) and a random graph (with the small world effect). 
SWN satisfy the main properties of social networks. Also, the structure of 
SWN combines between both local and nonlocal interactions which is ob- 
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served in many real systems. For example epidemic spreading show nonlocal 
interactions e.g SARS. 

The concept of SWN has been applied successfully in modelling many 
CAS, e.g, some games [Ahmed and Elgazzar 2000 a], epidemics [Ahmed et. 
al. 2002], economic systems [Elgazzar 2002], and opinion dynamics [Elgazzar 
2001]. 

An important property related to disease spread in a network is the second 
moment of the degree distribution i.e. (k 2 ). If it is divergent then on average 
a vertex has an infinite number of second nearest neighbors thus if a single 
vertex is infected the disease will spread in the whole network. This explains 
the results that disease spread on scale free networks has zero threshold 
(contrary to the ODE and PDE models). However one should realize that 
real networks are finite hence a kind of threshold is expected. 

Scale-free networks [Albert and Barabasi 2002] are another class of com- 
plex networks. A scale-free network does not have a certain scale. Some nodes 
have a huge number of connections to other nodes, whereas most nodes have 
only a few, following a power law distribution. 

Appendix (3): Basics of game theory 

Game theory [Hofbauer and Sigmund 1998] is the study of the ways in which 
strategic interactions among rational players produce outcomes (profits) with 
respect to the preferences of the players. Each player in a game faces a 
choice among two or more possible strategies. A strategy is a predetermined 
program of play that tells the player what actions to take in response to every 
possible strategy other players may use. A basic property of game theory is 
that one's payoff depends on the others' decisions as well as his. 

The mathematical framework of the game theory was initiated by von 
Neumann and Morgenstern in 1944. Also they had suggested the max-min 
solution for games which is calculated as follows: Consider two players A 
and B are playing against each other. Two strategies Si, S 2 are allowed for 
both of them. This game is called two-player, two-strategy game. Assume 
that the constants a, b, c and d represent the payoffs (profits) such that, if 
the two players use the same strategy Si (S2), their payoff is a(d). When a 
player with strategy Si plays against another one with strategy S2, the payoff 
of the Si-player is b and the payoff of the S2-player is c and so on. This is 
summarized in the payoff matrix as follows: 
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Si S2 

51 a b . 

52 c d 

The max-min solution of von Neumann and Morgenstern is for the first player 
to choose max{min(a, 6),min(c, d)}. The second player chooses min{max(a, c), 
max(fc, d)}. If both quantities are equal then the game is stable. Otherwise 
use mixed strategies. 

A weakness of this formalism has been pointed out by Maynard Smith in 
the hawk-dove (HD) game whose payoff matrix is 

H D 

II = H \{v-c) v . 
D I 

The max-min solution implies (for v < c) that the solution is D yet as he 
pointed out this solution is unstable since if one of the players adopts H in a 
population of D he will have a very large payoff which will make other players 
switch to H and so on till number of H is large enough that they play each 
other frequently and get the low payoff (v — c)/2. Thus the stable solution 
is that the fraction of hawks should be nonzero. To quantify this concept 
one may use the replicator equation which intuitively means that the rate of 
change of the fraction of players adopting strategy i is proportional to the 
difference between their payoff and the average payoff of the population i.e. 

dxi n 

— - = Xi [(Ux)i - xllx] , i = 1,2, ...,n, = 1, (20) 

at i=i 

where Xj is the fraction of players adopting strategy i, and II is the payoff 
matrix. Applying Eq. (20) to the HD game, one gets that the asymptotically 
stable equilibrium solution is x = v/c, where x is the fraction of hawks in 
the population. 

For asymmetric game the replicator dynamics equation is 

dX{ d-Vi 

= Xi [(Uiy)i - xlliy] , = y t {{U 2 x)i - yU 2 x] , % = 1, 2, n. 

A basic drawback of normal game theory is the assumption that all players in- 
teract globally. It is more realistic to study local games [Ahmed and Elgazzar 
2000 b] e.g. games on a lattice where players interact only with their nearest 
neighbors. Also there are several modifications for game formulations. 
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Appendix (4): Unpredictability in CAS 



There are at least two sources for unpredictability in CAS. The first is that 
CAS are open systems hence perturbing a CAS may affect another related 
one e.g. the insect population affected by the perturbation of Lake Victo- 
ria. Another reason is the nonlinear interactions [Strogatz 2000] between the 
elements of the CAS. The scientific and mathematical study of Chaos The- 
ory contains many overlaps with the study of Complex Systems, but with 
differences related to method: Chaos Theory can be used to study Complex 
Systems, but is not restricted to the study of these systems. Chaos Theory 
"deals with deterministic systems whose trajectories diverge exponentially 
over time" (Bar Yam, NECSI website). It has been used to study Complex 
Systems, because these systems can be generally defined as a "deterministic 
system that is difficult to predict" . On the other hand, complexity deals with 
systems composed of many interacting agents" The point being that Chaos 
Theory is one of many tools and methods that can be applied to the study 
of Complex Systems, but is not specifically devoted to the way these sys- 
tems are designed, developed, studied, and modeled. That being stated, the 
famous example of the "Butterfly Effect" in a chaotic system is an example 
of an agent (a butterfly) evoking a non-linear response (the storm in New 
England) within a Complex System (Global Weather System). 

A simple example of nonlinear interactions is the logistic difference equa- 
tion 

x t+ i = rx t (l - x t ), t = 0, 1,2, ...,n, r > 0. (21) 

This equation has two equilibrium solutions x — 0, x — 1 — 1/r (r > 1) 
which are asymptotically stable ifr<lorl<r<3 respectively. If 
3 < r < 3.6 then cycles appears and if r > 3.6 chaos sets in. Intuitively 
chaos is sensitive dependence on initial conditions (for more mathematical 
definition see [Holmgren 1996]). Hence in chaotic systems one cannot make 
long range predictions c.f. weather. A useful measure of chaos are Lyapunov 
exponents 

X = -J2Ln f(x t ) . (22) 



n t=o 



Since CAS consists of several interacting agents one studies coupled systems 
e.g. coupled map lattices [Kaneko 1993] given by 



D 



x^ = (1 - D)f(xl) + - If(xU) + f(xl +1 )\ , i = 1,2, ...,n. (23) 
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The homogeneous equilibrium is given by x = f(x) and it is asymptotically 
stable if [Ahmed and Hegazi 2002] 



/(*) 



h"TC 

;i-d) + dcos(— ) 

n 



< 1, k — 0, 1, - 1. 



(24) 



The more realistic case is to assume that the map depends on the agents e.g. 



x 



t+i 



D 



l-D)Mx\) + - f^ixU) + f i+1 (xl +1 ) , i = l,2, ..,n. (25) 



But analytic studies for Eq. (25) are more difficult. 

These systems shed some light on how to control (synchronize) some CAS 
[Ahmed et al 2003]. One may increase the coupling constant D. Also if the 
network of the agents is more connected (e.g. SWN), then the system is 
easier to synchronize. Finally external control can be applied preferably at 
highly connected sites. 



Appendix (5): Elements of mult i- objective op- 
timization 

Almost every real life problem is multi-objective (MOB) [Collette and Siarry 
2003]. Methods for MOB optimization are mostly intuitive. 

Definition (5): A MOB problem is: 

Minimize (min)Zj(x), i = 1, 2, k, subject to g(x) < 0, h(x) < 0. (26) 



Definition (6): A vector x* dominates x if Z^x) < Zj(x)Vi = 1,2,..., A; 
with strict inequality for at least one i, given that all constraints are satisfied 
for both vectors. 

A non-dominated solution x* is called Pareto optimal and the correspond- 
ing vector Zi(x*), i = 1,2,..., A; is called efficient. The set of such solutions 
is called a Pareto set. 

Now we discuss some methods for solving MOB problems: 



23 



The first method is the lexicographic method. In this method objectives 
are ordered according to their importance. Then a single objective problem 
is solved while completing the problem gradually with constraints i.e. 



min Zi subject to 
g(x) < 0, h(x) = ' 



(27) 



then if ZMIN(l) is the solution, the second step is min Z2 subject to Z\ = 
ZMIN(l), and the constraints in Eq. (26), and so on. 

A famous application is in university admittance where students with 
highest grades are allowed in any college they choose. The second best group 
is allowed only the remaining places and so on. This method is useful but in 
some cases it is not applicable. 

Proposition (6): An optimal solution for the lexicographic problem is 
Pareto optimal. 

Proof. Let x* be the solution to the Lexicographic problem Thus 

x ^ x*, then Z^x) = Z^x*), % = 1, 2, I - 1 and Z^x*) < Z x {x). (28) 
Thus x* is not dominated. 

The second method is the method of weights. Assume that it is required to 
minimize the objectives Z(j),j = l,2,...,n. (The problem of maximization 
is obtained via replacing Z(j) by —Z(j). Define 



Then the problem becomes to minimize Z subject to the constraints. This 
method is easy to implement but it has several weaknesses. The first is that it 
is not applicable if the feasible set is not convex. The second difficulty of this 
method is that it is difficult to apply for large number of objectives. However 
it is quite effective for multiobjective problems with discrete parameters since 
in this case Pareto optimal set is discrete not a continuous curve. 

The third method is the compromise method (sometimes called e— constr- 
aint method P £ {k). In this case one minimizes only one objective while 
setting the other objectives as constraints e.g. minimize Z(k) subject to 
Z(j) < a(j), j — 2, 3, k — l,k + l, n, where a(j) are parameters to be 



k k 



Z = Y,Ziiu(i), < w(i) < 1, £>(i) = 1. 



(29) 



i=i i=i 
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gradually decreased till no solution is found. The problem with this method 
is the choice of the thresholds a(j). If the solution is unique, then this method 
is guaranteed to give a Pareto optimal solution. 

Proposition (7): If the solution is unique, then the e— constraint method 
is guaranteed to give a Pareto optimal solution. 

Proof. Let x* be the optimal solution for the e— constraint method then 

V x 7^ x*, then Z k (x*) < Z k (x), 

hence x* is Pareto optimal. If x* is not unique, then it is weakly Pareto i.e. 
there is no x ^ x* such that Zi(x*) < Z^xpj i = 1, 2, n. 

A fourth method using fuzzy logic is to study each objective individually 
and find its maximum and minimum say ZMAX(j), ZMIN(j), respectively. 
Then determine a membership m(j) = (ZMAX(j) - Z (j)) / (ZMAX(j) - 
ZMIN(j)). Thus < m{j) < 1. Then apply max{min{m(j), j = l,2„n}}. 
Again this method is guaranteed to give a Pareto optimal solution provided 
that the solution is unique otherwise it is weakly Pareto. This method is 
a bit difficult to apply for large number of objectives. A fifth method is 
Keeney-Raiffa method which uses the product of objective functions to build 
an equivalent single objective one. 

Appendix (6): Fractional calculus in CAS 

Recently [Stanislavsky 2000] it became apparent that fractional equations 
solve some of the above mentioned problems for the PDE approach. To see 
this consider the following evolution equation 




(30) 



If the system has no memory then k(t — t) = S(t — t) and one gets f(t) 
/oexp(— \ 2 t). If the system has an ideal memory, then 




hence / ~ / O cos (^)- Using Laplace transform 
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[•OO 

L[f}= / f(t)exp(-st)dt, 
Jo 



one gets L[f] = 1 if there is no memory and L[f] = 1/s if there is ideal 
memory hence the case of non-ideal memory is expected to be given by 
L[f\ = l/s a , < a < 1. In this case Eq. (28) becomes 

df(t) _ f it- t)^f(t)dt 
dt 'Jo r(a) ' [ ' 

where r(a) is the Gamma function. This system has the following solution 

f(t) = f E a+1 (-\H a+1 ), 
where E a (z) is the Mittag Leffler function given by 



E a (z) = E 



It is direct to see that Ei(z) = exp(z), E 2 (z) = cos(z) 

Following a similar procedure to study a random process with memory, 
one obtains the following fractional evolution equation 

g^^(^ W(,, t )i o<a<l| (32) 

where P(x, t) is a measure of the probability to find a particle at time t at 
position x. 

We expect that Eq. (30) will be relevant to many complex adaptive 
systems and to systems where fractal structures are relevant since it is argued 
that there is a relevance between fractals and fractional differentiation [Rocco 
and West 1999]. 

For the case of fractional diffusion equation the results are 

d^PM = D d*P(x,t) p(xQ)=5(x) dP(x,0) = 

P = —^M(^i--,p\,P=^±, (33) 

oo / i\n n 

For the case of no memory a = =3- M(z, 1/2) = exp(— z 2 /4). 
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